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Abstract 


Coronaviruses pose serious health threats to humans and other animals. 
Understanding the mechanisms of their replication has important implications for 
global health and economic stability. Nonstructural protein 9 (nsp9) is an essential 
RNA binding protein for coronavirus replication. However, the mechanisms of the 
dimerization and nucleic acid binding of nsp9 remain elusive. Here, we report four 
crystal structures, including wild-type porcine delta coronavirus (PDCoV) nsp9, 
PDCoV nsp9-AN7 (N-terminal 7 amino acids deleted), wild-type porcine epidemic 
diarrhea virus (PEDV) nsp9, and PEDV nsp9-C59A mutant. These structures reveal 
the diverse dimerization forms of coronavirus nsp9. We first find that the N-finger of 
nsp9 from PDCoV plays a critical role in dimerization. Meanwhile, PEDV nsp9 is 
distinguished by the presence of a disulfide bond in the dimer interface. Interestingly, 
size-exclusion chromatography and analytical ultracentrifugation analyses indicate 
that the PDCoV nsp9-AN7 and PEDV nsp9-C59A mutants are monomeric in solution. 
In addition, electrophoretic mobility shift assays and microscale thermophoresis 
analysis indicate that the monomeric forms of PDCoV nsp9 and PEDV nsp9 still have 
nucleic acid binding affinity, but it is lower than that of the wild type. Our results 
show that the diverse dimerization forms of coronavirus nsp9 proteins enhance their 


nucleic acid binding affinity. 


Importance 


Coronaviruses cause widespread respiratory, gastrointestinal, and central nervous 
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system diseases in humans and other animals, threatening human health and causing 
economic loss. Coronavirus nsp9, a member of the replication complex, is an 
important RNA binding subunit in the RNA-synthesizing machinery of all 
coronaviruses. However, the mechanisms of the dimerization and nucleic acid binding 
of nsp9 remain elusive. In the study we have determined the nsp9 crystal structures of 
PDCoV and PEDV. We first find that the N-finger of nsp9 from PDCoV plays a 
critical role in dimerization. Meanwhile, PEDV nsp9 is distinguished by the presence 
of a disulfide bond in the dimer interface. This study provides a structural and 
functional basis for understanding the mechanism of dimerization and show that the 
diverse dimerization modes of coronavirus nsp9 proteins enhance their nucleic acid 
binding affinity. Importantly, these findings may provide a new insight for antiviral 


drug development. 


Introduction 


Coronaviruses (CoVs) are enveloped viruses, with single-stranded RNA genomes 
of positive polarity approximately 30 kb in length, that can be divided into four 
genera: a-CoV, B-CoV, y-CoV and 5-CoV (1, 2). The ideal hosts of a-CoV and B-CoV 
are mammals, and y-CoV primarily infects birds, while 5-CoV has been identified in 
both mammals and birds (3). In the past 15 years, there have been two epidemics of 
fatal pneumonia in addition to other outbreaks caused by CoVs. The severe acute 
respiratory syndrome CoV (SARS-CoV) emerged in China in 2002, and the Middle 


East respiratory syndrome CoV (MERS-CoV) emerged in Saudi Arabia in 2012 (4-6). 
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In addition, the PEDV and the PDCoV, two re-emerging and emerging epizootic 
swine viruses, have caused major economic losses in Asia and the United States (7-9). 
CoVs encode the ORF la and ORF lab polyproteins, which are then processed 
into 15-16 nonstructural proteins (nsps) by two virus-encoded proteinases, the 
papain-like and 3C-like proteinases (10-14). All these nsps, except for nsp1 and nsp2, 
are considered essential for transcription, replication and translation of the viral RNA 
(15-17). Nsp9 with nsp7, nsp8 and nsp10 localizes within the replication complex and 
is likely a member of the replication complex (18). The deletion of nsp9 in MHV 
prevents RNA synthesis and productive virus infection, while fusion of a nsp9-10 
oligoprotein results in viability but attenuated growth, indicating that the mature form 
of the nsp9 protein is critical for viral replication (19). Another study showed that 
nsp9 is a nucleic acid binding protein that is essential for replication (20). Meanwhile, 
the dimerization of nsp9 is critical for viral replication (21). For nsp9, there are five 
crystallographic structures that showed a variety of dimeric interfaces (22-25). 
However, the dimerization mechanism of all CoV nsp9s, whether from emerging 
viruses or not, is not so clear yet. 
In this study, we elucidated the crystal structures and functions of PDCoV nsp9 
and PEDV nsp9. The critical amino acids or motifs for nsp9 dimerization and nucleic 


acid binding were further investigated. 


Results 


Overall structure of the nsp9 
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The PDCoV nsp9 and PEDV nsp9 structures are refined to 1.80 A and 2.89 A 
resolution, respectively (Table 1). Their space groups are P2; and P64, respectively. 
With the exception of the regions encompassing amino acids 72-79, and 106-109 of 
PDCoV nsp9 and 1-6, 33, 55-58, and 106-108 of PEDV nsp9, all residues of both 
nsp9 proteins can be built in the final models (Fig. 1A, 1B). The crystal structures of 
PDCoV nsp9 and PEDV nsp9 reveal that their monomers contain seven antiparallel 
B-strands and one a-helix appended at the C-terminus of the polypeptide chain 
(residues 91-103), as well as loops connecting the B-strands (Fig. 1A, 1B). All seven 
B-strands form a B-sandwich, with sheet 61-3 interleaved between sheets B4-5 and 


66-7 (Fig. 1C). 
The N-finger plays an important role in the dimerization of PDCoV nsp9 


Based on the PDCoV nsp9 crystal structure, the dimer interface is formed by the 
N-fingers and the parallel association of the C-terminal a-helices. A total surface area 
of 924.3 A’ is buried upon dimerization (Fig. 2A). Although the N-terminus of nsp9 
may be flexible in other coronaviruses, we see clear electron density for this region in 
the 2Fo - Fc map (Fig. 2B). One hydrogen bond forms between the Leu4 main chain 


of subunit B and the Arg70 main chain of subunit A, and one hydrogen bond forms 


between the Arg7 main chain of subunit A and the Ser103 side chain of subunit B (Fig. 


2C, 2D). Leu4 and Arg7, which form part of the N-finger (Fig. 1D), clip onto the edge 
of the inner B-sheet (Arg70 on strand 6) and the edge of the a-helix (Ser103) from its 


dimer partner (Fig. 2C, 2D). Further stabilization is derived from the close packing of 
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the helices from the two monomers because the heart of the dimer interface consists 
of two glycines (Gly96, Gly100), which form a hydrophobic interface and stabilize 
the dimer (Fig. 2C, 2D). 

To determine whether the PDCoV nsp9 dimer found in the crystal is also present in 
solution, we performed size-exclusion chromatography (SEC) and analytical 
ultracentrifugation (AUC) experiments to confirm the oligomeric state of PDCoV 
nsp9 in solution. Our data show that PDCoV nsp9 eluted in two peaks with calculated 
molecular masses of approximately 12.64 kDa and 25.33 kDa, corresponding to a 
monomer and a dimer, respectively (Fig. 2E). Meanwhile, the AUC data, which are 
consistent with the SEC results, show that the PDCoV nsp9 protein existed in both 
monomeric and dimeric states in solution (Fig. 2F). 

Then, we determine whether the N-finger plays important roles in nsp9 dimer 
formation. The nsp9-AN7 protein was prepared, and SEC and AUC assays were used 
to analyze the oligomeric states of the two mutants. The SEC data show that the nsp9- 
AN7 mutant eluted in only one peak, which corresponded to a monomer (Fig. 2G, 2H). 
Meanwhile, the nsp9-L4A/L6A/R7A/N8A mutant has a monomeric form (Fig. 2G, 
2H). The AUC results reveal a monomeric state of the nsp9-AN7 mutant with a 
calculated molecular weight of 10.94 kDa, a sedimentation coefficient (S20,w) of 1.349, 
and a frictional ratio (fo) of 1.317 (Fig. 2F). In addition, PDCoV nsp9-AN7 is 
crystallized in the space group P2;. The structure is determined by molecular 
replacement using the structure of wild-type PDCoV nsp9 as the search template and 


is refined at 1.99 A resolution (Table 1). The structure of PDCoV nsp9-AN7 remains 
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basically consistent with wild-type PDCoV nsp9 except for the N-finger. The 
structural superposition of the wild-type nsp9 monomer with the nsp9-AN7 mutant 
monomer shows a root mean square deviation (RMSD) of 0.9 A between the 90 Ca 
atoms (Fig. 3). According to these results, the N-finger plays an important role in 
maintaining the dimer stability. 

Meanwhile, the SEC data show that the nsp9-G96E and nsp9-GI00E mutants have 
a monomeric form (data not shown), indicating that the Gly96 and Gly100 of 


GXXXG motif can also impact the dimerization. 


The importance of the dimerization-related amino acids in nucleic acid binding of 


PDCoV nsp9 


Seven mutants, including nsp9-L4A, nsp9-L6A, nsp9-R7A,  nsp9-N8A, 
nsp9-L4A/L6A/R7A/N8A, nsp9-G96E and nsp9-GIO0E, were designed to test 
whether the N-finger and the glycines in the a-helix of PDCoV nsp9 are related to 
nucleic acid binding. Then, two EMSA assays were performed to examine the nucleic 
acid binding affinity of each mutant, and microscale thermophoresis (MST) assays 
were used to determine the quantitative value of the binding affinity. The results 
reveal that Leu4 and Leu6 are not important amino acids for nucleic acid binding. The 
wild-type PDCoV nsp9 protein bound to ssDNA with a Kg of 410 uM, and the ssDNA 
binding affinities of the nsp9-R7A, nsp9-G96E and nsp9-GI1O0E mutants are 
2.5~5-fold weaker than that of wild-type PDCoV_ nsp9. In addition, 
nsp9-L4A/L6A/R7A/N8A shows almost no binding affinity (Fig. 4A, 4B, 4C). The 


results of the EMSA and MST assays indicate that Arg7 in the N-finger and the 


Ayisueaiuc) yeoipeyy Bunisyoey Aq g1og ‘Zz eunr uo /Buo-wse'IAl//:diyjy Wo pepeojumMog 


(0) 
= 
6 
me) 
(0) 
— 
(72) 
O 
jae 
— 
(OL 
= 
O 
72) 
=) 
(= 
= 
TO 
ve 
[ok 
0) 
O 
O 
~s 


Journal of Virology 


Journal of Virology 


152 


153 


154 


155 


156 


157 


158 


159 


160 


161 


162 


163 


164 


165 


166 


167 


168 


169 


170 


171 


172 


glycines in the a-helix of PDCoV nsp9 play important roles in nucleic acid binding. 
Based on the sequence alignment, the N-finger motif and GXXXG motif of the 
protein are highly conserved (Fig. 1D). To determine whether these motifs are 
actually critical for the dimerization of other re-emerged coronavirus nsp9s, the 


PEDV nsp9 should be further investigated. 


The disulfide bond is critical for the dimerization of PEDV nsp9 


Two potential dimer interfaces are observed in the PEDV nsp9 crystal. One of the 
dimers observed in the crystals is formed by the parallel association of the C-terminal 
a-helices (Fig. 5A). The helices from the two monomers pack together closely 
because the heart of the dimer interface consists of three glycines (Gly95, Gly99 and 
Gly102), which form a hydrophobic interface and stabilize the dimer. The a-helix 
atoms of the two Gly95 residues show the closest distance of 3.3 A, and the distance 
between the two Gly102 residues is 10.3 A, with a very weak hydrophobic interaction 
(Fig. 5B). Compared with PDCoV nsp9, the N-finger of PEDV nsp9 may be flexible 
and cannot be built in the model. 

Another dimer of PEDV nsp9 forms a disulfide-linked homodimer (Fig. 5C), in 
which the two monomers are linked by the disulfide bond formed between the Cys59 
residues. Two f-strands, 4 and 5, are also involved in dimerization through the 
formation of two hydrogen bonds between the Cys59 side chain and the main chain of 
residue 61, two H-bonds between the Asn60 main chain and the side chain of residue 


61, and hydrophobic interactions among residues Lys52, Gly58, Cys59, Asn60 and 
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Thr61 (Fig. 5D). 

We applied biochemical techniques to determine the oligomeric state of PEDV 
nsp9 in solution. We performed AUC and confirmed the monomeric and dimeric 
states of PEDV nsp9, with calculated molecular weights of 14.22 kDa and 20.92 kDa, 
respectively. The sedimentation coefficients (S2o,w) are 1.489 and 1.927, respectively, 
and the frictional ratio (///o) is 1.421 (Fig. SE). The AUC assays show that monomers 
and dimers are the primary forms of PEDV nsp9 protein in solution, coexisting with a 
small number of higher oligomers. 

To determine whether Cys59 plays important roles in nsp9 dimer formation, one 
mutant, namely, nsp9-C59A, was designed. Then, the AUC assay was used to analyze 
the oligomeric states of the mutant. The AUC results for the nsp9-C59A mutant reveal 
a monomeric state, with a calculated molecular weight of 12.2 kDa, a measured 
sedimentation coefficient (S20,w) of 1.372, and a frictional ratio (//fo) of 1.393 (Fig. 
SE). Then, PEDV nsp9 and the nsp9-C59A mutant were treated with SDS-PAGE 
loading buffer without DTT, a reagent that can be used to disrupt the intramolecular or 
intermolecular disulfide bond formed by cysteines, and SDS-PAGE was performed 
(Fig. 5F). We used freshly prepared PEDV nsp9 protein from E. coli to perform the 
AUC and SDS-PAGE which have proved the disulfide bond may be formed in E. coli 
and indicated that the PEDV nsp9 dimer has a disulfide bond and Cys59 is an 
important residue for PEDV nsp9 dimer formation. In addition, to determine whether 
the PEDV nsp9 protein has the disulfide bond in mammalian cells, HEK293T cells 


were transfected with an empty vector or wild-type PEDV nsp9 plasmids. At 48 h 
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post-transfection, cells were treated with RIPA lysis buffer. Then the lysates were 
treated with SDS-PAGE loading buffer with or without DTT and immunoblotting was 
performed. The results show that the PEDV nsp9 is monomer in cells and indicate that 
the disulfide-bonded nsp9 dimer might be an artifact in the infected cells (Fig. 5G). 

To further validate the importance of the disulfide bond in dimer formation, we 
analyzed the structure of the nsp9-C59A mutant. The crystals belong to space group 
P432,2. The structure is determined by molecular replacement and refined to 3.0 A 
resolution (Table 1). The crystal structure of the nsp9-C59A mutant monomer also 
shows 7 B-strands and an a-helix appended at the C-terminus (Fig. 5H). Neither a 
disulfide bond nor any analogous dimer interface to that formed by the disulfide bond 
is observed in the structure. The structural superposition of the wild-type nsp9 
monomer with the nsp9-C59A mutant monomer yields 94 Ca atoms with an r.m.s. 
deviation of 0.2 A (Fig. 5H). 

Taken together, the crystal structure and biochemical assays reveal the crucial role 
of Cys59 in PEDV nsp9 dimer formation, and the disulfide-linked homodimer is the 
primary form of the PEDV nsp9 dimer. However, PEDV nsp9 is monomer in cells and 


the disulfide-bonded nsp9 dimer might be an artifact in the infected cells. 


The importance of dimerization-related amino acids in nucleic acid binding of 


PEDV nsp9 


To test whether Cys59 and the glycines in the a-helix (Gly95, Gly99, Gly102) are 
related to nucleic acid binding of PEDV nsp9, we designed three mutants, nsp9-C59A, 


nsp9-G95E/G99E/G102E and nsp9-C59A/G95E/G99E/G102E. Two EMSA assays 
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were performed to examine the nucleic acid binding affinity of each mutant. The 
ssDNA binding affinity of the nsp9-C59A mutant is somewhat weaker than that of the 
wild-type nsp9 protein, whereas the nsp9-G95E/G99E/G102E mutant shows very 
weak binding affinity, and the nsp9-C59A/G95E/G99E/G102E mutant shows almost 
no binding affinity (Fig. 6A, 6B). Furthermore, MST assays were performed to obtain 
quantitative measurements of the binding affinities. The results show that the 
wild-type PEDV nsp9 protein bound to ssDNA with a Ka of 145 uM, the nsp9-C59A 
mutant shows a 2.7-fold reduction in ssDNA binding, the nsp9-G95E/G99E/G102E 
mutant shows a 14-fold reduction, and the nsp9-C59A/G95E/G99E/G102E mutant 
shows much weaker binding affinity, with a 36-fold reduction (Fig. 6C). The results 
of the EMSA and MST assays indicate that Cys59 and the glycines in the a-helix of 


PEDV nsp9 play important roles in nucleic acid binding. 


The importance of positively charged amino acids in nucleic acid binding of 


PEDV nsp9 


The surface electrostatic potential analysis of PEDV nsp9 reveals a highly 
positively charged patch suggestive of a potential site for nucleic acid binding (Fig. 
7A). To determine whether this surface contributes to the ssDNA binding activity of 
PEDV nsp9, we introduced individual substitutions of positively charged residues, 
and their effects on ssDNA binding were assessed by EMSA and MST assays. In this 
study, residues Lys10, Arg68, Lys69 and Arg106 were mutated to alanine because of 
their highly positively charged potential and their exposure on the surface of the nsp9 


structure. One aromatic amino acid, Tyr82, was also mutated to alanine (Fig. 7A). The 
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EMSA results show that the nsp9-K10A, nsp9-R68A/K69A, and nsp9-R106A mutants 
show slightly weaker ssDNA binding activity, whereas the 
nsp9-K10A/R68A/K69A/R106A mutant reveals a significant loss of ssDNA binding 
affinity, and the nsp9-Y82A mutant shows clearly enhanced ssDNA binding affinity 
(Fig. 7B). The MST assay indicates that the nsp9-KI0A and nsp9-R106A mutants 
have 1.3-fold and 2.7-fold reductions in ssDNA binding affinity compared with the 
wild-type protein, respectively, whereas the nsp9-K10A/R68A/K69A/R106A mutant 
shows much weaker binding affinity, with a 7.2-fold reduction, and nsp9- Y82A shows 
much stronger binding affinity, with an 8.0-fold increase (Fig. 7C). The ssDNA 
binding activity of the nsp9-R68A/K69A mutant could not be measured by the MST 
assay because of the protein’s sedimentation characteristics. The results of the EMSA 
and MST assays indicate that the positively charged surface of the PEDV nsp9 does 


indeed contribute to the ssDNA interaction. 


Discussion 


Diverse dimerization of coronavirus nsp9 


In the emerging CoV PDCoV, we first confirm that the N-finger motif is critical 
for dimerization. Moreover, the dimerization of PDCoV nsp9 is also supported by the 
parallel association of the C-terminal a-helix via strong hydrophobic interactions. 
Although the monomeric structures of nsp9 from PDCoV exhibited significant 
similarity to those from other CoVs, the dimerization modes showed high diversity. 


IBV nsp9 forms a homodimer via interactions across a hydrophobic interface, which 
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consists of two parallel alpha-helixes near the carboxy terminus of the protein (25). 
For the SARS-CoV nsp9 protein, the N-terminal extended B chain and the C-terminal 
a-helices stabilize its interface (24). The N-terminal extended B chain of SARS nsp9 
is supported by the extended tag, which may lead to the unclear function of its 
N-finger motif. Here, SARS-CoV nsp9-AN7 mutant is monomeric in solution which 
is consistent with wild-type PDCoV nsp9-AN7 (data not shown). The N- and 
C-termini of the protein are more conserved than the central core region, and the 
GXXXG motif is strictly conserved (24, 26). In other proteins, mutation of the Gly 
residues in the motif led to the decreased or the complete loss of dimerization of 
transmembrane helices (21, 27, 28). The GXXXG motif of nsp9 is important for 
maintaining the dimer form. PDCoV nsp9 AN7 without the N-finger forms the 
monomeric stage. We conclude that the N-finger motif and the GXXXG motif are 
both important for dimerization of PDCoV nsp9. 

In the dimer interface of PEDV nsp9, similar with HCoV-229E nsp9, there is a 
disulfide bond. However, the cysteine is located at a different position, resulting in a 
very different dimer conformation. PEDV nsp9 primary forms a disulfide-linked 
homodimer, in which the two monomers are linked by the disulfide bond formed 
between the Cys59 residues. For HCoV-229E nsp9, a disulfide bond is involved in 
dimerization (23), which is consistent with our results. However, the weak effect of 
the C59A mutation on nucleic acid binding indicate that dimerization of PEDV nsp9 
by disulfide formation may not be a major form in the infected cells. Moreover, the 


overall milieu is reductive, and disulfide bonds are rare in the cytoplasm of the 
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infected cell (29). The biological function of the disulfide bond should be further 


investigated by a reverse genetics system. 


Nucleic acid binding by nsp9 


The crystal structures of CoV nsp9s_ share great similarity in the 
oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to 
ssDNA or ssRNA (30). Our data show that PDCoV nsp9 and PEDV nsp9 both have 
binding activity with nucleic acids, which is consistent with other CoV nsp9 proteins 
(21, 23, 25). In addition, the dimeric state of the PDCoV nsp9 and PEDV nsp9 can 
increase the binding activity with nucleic acids. Our EMSA and MST experiments 
also suggest that the N-finger of PDCoV nsp9 can influence the affinity to nucleic 
acids. In a previous study, the SARS-CoV nsp9 dimer was shown to be required for 
viral growth (21). The nsp9 dimerization is critical for the function of this replicase 
protein for the IBV infectious clone system (31). We hypothesize that the dimer of 
PDCoV nsp9 and PEDV nsp9 may be critical for viral replication. Whether the 
N-finger of PDCoV nsp9 or the cysteine of PEDV nsp9 affects viral replication 
should be further studied by using reverse genetic approaches. 

Nucleotides interact with proteins primarily via aromatic amino acid side chains, 
hydrophobic side chains or the aliphatic portions of polar and positively charged 
groups, such as lysine and arginine (30). A previous study of SARS-CoV nsp9 
suggested that the most likely site of RNA binding is on the loops L23, L45 and L7H1, 


as this face is accessible in the preferred helix-stabilized dimer but largely occluded in 
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the putative B sheet-stabilized dimer (24). Based on the structure of SARS-CoV nsp9, 
we observe that positively charged amino acids are present on the loops. In our study, 
the PEDV nsp9-K10A, nsp9-R68A/K69A and nsp9-R106A mutants mildly affect 
ssDNA binding activity, whereas the nsp9-K10A/R68A/K69A/R106A mutant reveals 
a significant reduction in ssDNA binding (Fig. 7B, 7C), which confirmed the 
importance of highly positively charged residues exposed on the protein surface in 
ssDNA binding. Many amino acids in a single protein may participate in ssDNA 
binding, and thus, it is reasonable that a single-residue mutation could have a 
moderate effect on ssDNA binding activity. We hypothesize that the potential site of 
RNA binding is on the face of PEDV nsp9 present on strands B1 and B6 (Fig. 1B). 
Surprisingly, the nsp9-Y82A mutant strongly enhances the ssDNA binding affinity 
(Fig. 7B, 7C) and L6A and N8A of PDCoV nsp9 unexpectedly induced an 
enhancement of the ssDNA binding affinity, which may be caused by 
decreasing steric hindrance. Whether the nsp9-Y82A mutant affects viral replication 
should be further investigated. In addition, we have attempted to solve the structure of 
the nsp9-RNA or DNA complexes. Unfortunately, we were unable to obtain the 
crystals of the nsp9-RNA or DNA complexes. 

Nsp9 is able to bind ssDNA or ssRNA (22, 25), although binding of ssRNA is 
expected to be the native function. In the infected cell, the coupling/compartmentation 
of the viral RNA synthesis with the RNA-binding function of nsp9 might render RNA 
versus DNA specificity unnecessary (22). Moreover, as a member of the replication 


complex, nsp9 may not have a specific binding sequence but may act in conjunction 
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with other nsps as a processivity factor (21). According to a model for the initiation of 
coronavirus negative-strand RNA synthesis, a protein complex (including nsp8 and 
nsp9) binds to viral RNA initially (32). The potential site of RNA binding may 
provide a new insight into the binding of the replication/transcription complexes to 
viral RNA. 

CoV nsp9 has diverse dimerization, while its function of enhancement nuclear 
binding affinity is conservative (22-25). We propose that most nsp9 proteins 
originated from ancestral coronavirus, which have N-finger motifs and GXXXG 
motifs that both play critical roles in dimerization. The N-terminal loops and GXXXG 
motifs of PDCoV, IBV and SARS-CoV nsp9 are also involved in dimerization, while 
the extra amino acids (the residues of fusion tag) in the N-finger may block the fully 
function of the N-finger (22, 24, 25). Interestingly, PEDV and HCoV-229E evolve 
different disulfide bonds to enhance the dimerization (23). Using diverse dimerization 
strategies, nsp9 might increase the nucleic acid binding interface and then promote its 
nucleic acid binding affinity, which might stabilize nascent viral RNAs during 
replication or transcription, thus providing protection from nucleases (22, 24). 

In summary, CoV nsp9s have diverse forms of dimerization that promote their 
biological function, which may help elucidate the mechanism underlying CoVs 


replication and contribute to the development of antiviral drugs. 


Materials and methods 


Protein production and crystallization 
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The full-length nsp9 genes of PDCoV and PEDV were amplified by PCR from 
virus-derived cDNA fragments and cloned separately into pET-42b. Both nsp9s were 
expressed with a C-terminal Hise tag in E. coli BL21 (DE3). High-quality soluble 
PDCoV nsp9 protein and PEDV nsp9 protein were obtained by adding 1 mM IPTG 
(isopropyl-B-D-thiogalacto-pyranoside) to LB when the bacteria had grown to an 
optical density at 600 nm of approximately 0.6 at 37 °C, followed by shaking at 37 °C 
for 4 h and at 27 °C for 10 h, respectively. For protein purification, the E. coli cells 
were harvested, resuspended with phosphate-buffered saline (PBS), and lysed via 
passage through an AH-1500 homogenizer. After centrifugation at 8,500 rpm for 40 
min, the supernatant was filtered and loaded onto a HisTrap HP column (GE 
Healthcare). The PDCoV nsp9-AN7, PEDV nsp9-CS59A and other nsp9 mutants were 
expressed and purified as described above. 

Crystallization screening was performed by the sitting drop vapor diffusion 
method at 20 °C. PDCoV nsp9 (5.0 mg/ml) was mixed at a 1:1 ratio with 
crystallization cocktails. Wild-type protein crystals and Se-Met-labeled nsp9 crystals 
appeared within 3 days at 20 °C in 2.0 M sodium chloride and 0.1 M citric acid (pH 
3.5). The PDCoV nsp9-AN7 mutant protein crystals appeared within 3 days at 20 °C 
in 0.2 M sodium chloride, 0.1 M NazHPOa:citric acid (pH 4.2) and 20% (w/v) PEG 
8000. After optimization, the wild-type crystals or Se-Met-labeled nsp9 crystals were 
stabilized in 3.0 M sodium chloride and 0.1 M citric acid (pH 3.5), and the nsp9-AN7 
crystal was stabilized in 0.1 M Na ,HPOs,:citric acid (pH 4.2) and 22% (w/v) PEG 


8000. Meanwhile, PEDV nsp9 (6.5 mg/ml) was mixed at a 1:1 ratio with 
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crystallization cocktails. Wild-type protein crystals appeared within 40 days at 20 °C 
in 0.1 M Tris (pH 8.5) and 25% v/v tert-butanol, and PEDV nsp9-C59A mutant 
protein crystals appeared within 2 days at 20 °C in 0.2 M sodium thiocyanate (pH 6.9) 
and 12% PEG 3350. After optimization, the wild-type crystals were stabilized in 0.3 
M Tris (pH 8.5) and 13% v/v tert-butanol, and the nsp9-C59A mutant crystals were 
stabilized in 0.25 M sodium thiocyanate (pH 7.7), 10% PEG 3350, and 2% ethylene 


glycol. There was no reductant (DTT) in the crystallization set-up. 


Data collection and structure determination 


X-ray diffraction data were collected at beamline BLI7U at the Shanghai 
Synchrotron Radiation Facility. The structure of PDCoV nsp9 was solved using the 
single-wavelength anomalous dispersion (SAD) method and a Se-Met-derivative nsp9 
wild-type protein. Both potential selenium atoms in the nsp9 monomer were located, 
and the initial phases were calculated using the AutoSol program from the PHENIX 
software suite (33). The molecular replacement program HKL-3000 (34) was used to 
solve the structure of PEDV nsp9, with a monomer of HCoV-229E nsp9 (PDB 
ID:2J97) as the search model. Manual model rebuilding was performed using COOT 
(35) and then refined in the PHENIX software suite. The figures were created using 


PyMOL (Schrodinger). 


Structural analysis and sequence alignment 


Detailed molecular interactions between the two monomers of nsp9 were 


determined using LIGPLOT (36), and the other structure figures were generated using 
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PyMOL (Schrédinger). The buried surface areas between the two monomers and the 
root mean square deviation (RMSD) were analyzed using PDBePISA 
(http://pdbe.org/pisa/) and PDBeFold (http://pdbe.org/fold/), respectively. Additionally, 
the amino acid sequences of coronavirus nsp9 were aligned using ClustalW2 (37) and 


visualized with the ESPript 3 server (http://espript.ibcp.fr) (38). The analyzed viruses 


(abbreviation; NCBI accession number) were as follows: PEDV (porcine epidemic 
diarrhea virus; AHZ94880), PDCoV (porcine delta coronavirus; APG38197), TGEV 
(transmissible gastroenteritis virus; AML22776), HCoV-229E (human coronavirus 
229E; NP_073550), SARS-CoV (SARS coronavirus Shanhgai LY; AAP82976), 
MERS-CoV (middle east respiratory syndrome-related coronavirus; AHI48749), 
MHV (murine hepatitis virus strain A59; ACO72881), IBV (Infectious bronchitis 


virus; ACJ12833). 


Size-exclusion chromatography (SEC) 


Oligomerization of wild-type (0.1 mg) and mutant (AN7 and 
L4A/L6A/R7A/N8A) (0.1 mg) PDCoV nsp9 proteins was analyzed using a Superdex 
75 10/300GL column (GE Healthcare) with a buffer containing 20 mM Tris-HCl (pH 
7.4) and 200 mM NaCl at a flow rate of 0.6 ml/min (4 °C). Wild-type and mutant 
(AN7 and L4A/L6A/R7A/N8A) nsp9 proteins eluted in different fractions were 


analyzed by SDS-PAGE. 


Immunoblotting 


To obtain high expression in eukaryotic cells, PEDV nsp9 wild-type, flanked by 
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an C-terminal hemagglutinin (HA) tag, was cloned into pCAGGS vector using the 
EcoRI and XhoI restriction sites. 

HEK293T cells cultured in 6-well plates (Corning, Tewksbury, MA, USA) were 
transfected with an empty vector or wild-type PEDV nsp9 plasmids (2.5 ug) using 
Lipofectamine 2000. At 48 h post-transfection, cells were treated with RIPA lysis 
buffer (Beyotime). Then the lysates were treated with SDS-PAGE loading buffer with 
or without DTT. For the immunoblot analysis, 10 ul of lysate was electrophoresed on 
12% SDS-PAGE gels (Bio-Rad). Proteins were transferred to PVDF membranes 
(Bio-Rad) according to the manufacturer's recommendations. Membranes were 
incubated in blocking buffer (5% nonfat dried milk in TBS-Tween [50 mM Tris, pH 
7.6, 150 mM NaCl)) for 3h at room temperature (RT). Anti-HA antibody (Ab; MBL) 
was diluted 1:10,000 in blocking buffer and incubated with the membrane for 3h at 
RT. Membranes were washed three times for 15min at RT with TBS-Tween. Goat 
anti-mouse horseradish peroxidase-conjugated antibodies (BOSTER) were diluted 
1:5,000 in blocking buffer and incubated with the membrane for 1h at RT. Membranes 
were washed three times for 15 min at RT with TBS-Tween. Then, the membranes 
were visualized using an enhanced chemiluminescence system (Amersham Imager 


600, GE Healthcare). 


Analytical ultracentrifugation (AUC) 


Sedimentation velocity was assessed with an XL-I analytical centrifuge (Proteome 


Lab, Huazhong Agricultural University, Wuhan) at 60,000 rpm and 18 °C in 400 ul 
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double sector cells for sedimentation analytical ultracentrifugation. The sedimentation 
boundary was monitored every minute at a wavelength of 280 nm. Samples were at a 
concentration of 1 mg/ml in 20 mM Tris, 200 mM NaCl (pH 7.4) buffer. Data were 
interpreted with the model-based distribution of Lamm equation solutions c(s) using 
the software Sedfit (39). Weight-averaged molar masses were calculated using Vector 


NTI software. 


Electrophoretic mobility shift assay (EMSA) 


An electrophoretic mobility shift assay was performed as described previously (40). 
A 5’-Cy5-labeled 36-mer ssDNA oligonucleotide 
(TAGTACCGCCACCCTCAGAACCTTTTTTTTTTITTIT) (synthesized by Sangon 
Biotech, Shanghai, China) was used for the gel shift assay (22). Initially, 5 uM ssDNA 
was incubated with different concentrations of PDCoV and PEDV nsp9s and the 
mutant proteins in 10 mM Tris (pH 8.0) and 200 mM NaCl for 45 min at room 
temperature, followed by the addition of 10% glycerol to the mixture. Samples were 
then run on 6.5% nondenaturing TBE polyacrylamide gels for 37 min at a voltage of 
180V, and the results were determined with an FLA-2000 fluorescent image analyser 
(Fuji, Stamford, CT). 

Zone-interference gel electrophoresis was described in the publication by 
Ponnusamy et al (23). The ssDNA was dissolved at a concentration of 2 nmol before 
loading onto a 1% agarose gel. The gel was run for 50 min with running buffer (20 


mM Tris, 50 mM boric acid, 1 mM NaEDTA, pH 8.3) at 100 mA, 4°C. With the poles 
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of the electrodes interchanged, 1 nmol nsp9 protein was loaded and run for another 50 
min. After electrophoresis, the gel was fixed in 3.5% (w/v) a-sulfosalicilic acid, 10% 
(w/v) trichloroacetic acid, until the dye turned yellow. For the detection of protein 
bands, the gel was washed for 15 min in 15% (v/v) ethanol, 8% (v/v) acetic acid, and 
stained for 30 min with 0.25% (w/v) Coomassie brilliant blue in the same solution 
with additional 10% (v/v) methanol. The gel was washed in 15% (v/v) ethanol, 8% 


(v/v) acetic acid, and stored in 10% (v/v) acetic acid. 


Microscale thermophoresis (MST) assay 


Specific binding between nsp9 or the nsp9 mutant proteins and ssDNA was 
measured by MST assays as previously described (41, 42). Briefly, nsp9 or nsp9 
mutant proteins were serially diluted with buffer containing 10 mM Tris (pH 8.0) and 
200 mM NaCl, and then, 10 nM Cy5-labeled ssDNA diluted with buffer containing 10 
mM Tris (pH 8.0), 200 mM NaCl and 0.05% Tween-20 was added. The mixtures were 
loaded into NT.115 standard capillaries (Nanotemper Technologies), and 
thermophoresis was carried out at room temperature, 100% LED, and 20% IR-laser 
power with a Monolith NT.115 (Nanotemper Technologies). The data were analyzed 


with the Nanotemper Analysis software v.1.2.101. 


Accession numbers 


The atomic coordinates of wild-type PDCoV nsp9, PDCoV nsp9-AN7, wild-type 
PEDV nsp9 and the nsp9-C59A mutant have been deposited in the RCSB Protein 


Data Bank with the accession codes 5YM6, 5YM§8, 5HIZ and 5HIY, respectively. 
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610 Figure legends 


611 Figure 1. Stereoview of the structures of PDCoV nsp9 and PEDV nsp9. (A) 


612 Ribbon representation of the PDCoV nsp9 monomer. (B) Ribbon representation of the 
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613. PEDV nsp9 monomer. (C) Schematic illustration of nsp9 topology. The 6 strands are 

614 drawn as arrows and a-helices as cylinders. (D) Sequence alignment of CoV nsp9 

615 homologs to PDCoV nsp9 and PEDV nsp9. Identical residues are highlighted in red, 

616 and conserved residues are shown in yellow. The table was produced using ESpript3.0 

617 —_with the secondary structure elements for PDCoV nsp9 and PEDV nsp9 assigned 

618 using DSSP (43). Residues boxed in red are completely conserved. 

619 Figure 2. The N-finger of the PDCoV nsp9 has a strong effect on dimerization. (A) 
620 The two monomers are colored cyan and magenta, respectively, and depicted in 


621 cartoon and surface representations. (B) 2Fo — F electron density map of the N-finger 
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at 1.50. This region includes ten residues. (C) Dimerization interface of PDCoV nsp9. 
The interacting residues (Leu4, Arg7, Arg70, Gly96, Gly100 and Serl03) between 
two monomers are shown with stick illustrations. (D) All the interacting residues 
between subunits A (blue) and B (magenta) were determined using LIGPLOT. (E) 
Calculated molecular weights of the PDCoV nsp9 protein peaks with the values 
obtained for known calibration standards (GE Healthcare). The calculated molecular 
weight of the PDCoV nsp9 peaks was determined by fitting to the calibration curve 
(K,y=volumes of elution [Ves]/24); the volumes of elution of 10.87 ml (approximately 
25.33) and 12.49 ml (approximately 12.64) are indicated by arrows. (F) Velocity AUC 
analysis of the PDCoV nsp9 and mutant proteins. The molecular mass represented by 
each major peak is indicated, and the biophysical data are shown in the table. (G) Size 
exclusion experiment with the wild-type and mutants (AN7 and L4A/L6A/R7A/N8A) 
of PDCoV_ nsp9. (H) SDS-PAGE analysis of wild-type and mutant 
(L4A/L6A/R7A/N8A and AN7) nsp9. The elution volume is described in panel E. 
Molecular mass markers are shown. 

Figure 3. Illustration of the superimposition of PDCoV nsp9-WT and PDCoV 
nsp9-AN7 monomer structures. The PDCoV nsp9-WT and PDCoV nsp9-AN7 
monomers are colored orange and blue, respectively, and depicted in cartoon 
representation. The structural superposition shows a root mean square deviation 
(RMSD) of 0.9 A between the 90 Ca atoms. 

Figure 4. The N-finger motif can affect the ssDNA binding abilities of PDCoV 


nsp9. (A) The ssDNA binding abilities of nsp9-WT, nsp9-AN7, nsp9-G96E, 
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nsp9-G100E, nsp9-L4A, nsp9-L6A, nsp9-R7A, nsp9-N8A and 
nsp9-L4A/L6A/R7A/N8A as determined by EMSA. Each protein was assayed at 
different concentrations (0 uM, 25 uM, 50 uM) with 5 uM 5’-Cy5-labeled 36-mer 
ssDNA. (B) Zone-interference gel electrophoresis illustrating the association of 
nsp9-WT, nsp9-AN7 and nsp9-L4A/L6A/R7A/N8A with 36-mer ssDNA. The red 
dotted lines represent the position of the bands of nsp9-WT, nsp9-AN7 and 
nsp9-L4A/L6A/R7A/N8A without reaction with ssDNA. (C) The ssDNA binding 
affinities of nsp9-WT and mutants were measured by MST assays. Each protein was 
assayed in twofold concentration steps with 10 nM 5’-Cy5-labeled 36-mer ssDNA. 
The measured Kg of each protein is shown in the table. 

Figure 5. Two potential types of PEDV nsp9 dimers and the crucial role of 
cysteine. (A) Orthogonal views of the helix dimer. The two monomers are colored 
yellow and light blue and are depicted in cartoon and surface representations. (B) The 
heart of the dimer interface is formed from three glycines (Gly95, Gly99 and Gly 102). 
(C) Orthogonal views of the disulfide-linked dimer. The two monomers are colored 
purple and split-pea and are depicted as cartoon and surface representations. (D) The 
disulfide bond is depicted as a magenta dotted line and the hydrogen bonds as red 
dotted lines, and residues located in the dimer interface are labeled in green. (E) 
Velocity AUC analysis of the PEDV nsp9 and nsp9-C59A mutant proteins. The 
molecular mass represented by each major peak is indicated, and the biophysical data 
are shown in the table. (F) SDS-PAGE analysis of freshly prepared proteins treated 


with 5xSDS-PAGE loading buffer without DTT. The differences between nsp9 and 
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the nsp9-C59A mutant proteins were detected on polyacrylamide gels. (G) HEK293T 
cells were transfected with an empty vector or wild-type PEDV nsp9 plasmids. At 48h 
post-transfection, the cell lysates were treated with SDS-PAGE loading buffer with or 
without DTT and subjected to Western blotting. (H) Illustration of the superimposition 


of PEDV nsp9 and PEDV nsp9-C59A monomeric structures. 


Figure 6. The disulfide bond can affect the ssDNA binding abilities of PEDV nsp9. 


(A) The ssDNA binding abilities of nsp9-WT, nsp9-C59A, nsp9-G95E/G99E/G102E, 
and nsp9-C59A/G9S5E/G99E/G102E as determined by EMSA. Each protein was 
assayed at different concentrations (0 uM, 0.41 uM, 0.82 uM, 1.64 uM, 3.28 uM, 6.56 
uM, 10 uM) with 5 uM 5’-Cy5-labeled 36-mer ssDNA. (B) Zone-interference gel 
electrophoresis illustrating the association of nsp9-WT, nsp9-C59A and nsp9- 
C59A/G95E/G99E/G102E with 36-mer ssDNA. The red dotted lines represent the 
position of the bands of nsp9-WT, nsp9- C59A and nsp9- C59A/G95E/G99E/G102E 
without reaction with ssDNA. (C) The ssDNA binding affinities of nsp9-WT, 
nsp9-C59A, nsp9-G95E/G99E/G102E, and nsp9-C59A/G95E/G99E/G102E were 
measured by MST assays. Each protein was assayed in twofold concentration steps 
with 10 nM 5’-Cy5-labeled 36-mer ssDNA. The measured Ky of each protein is 
shown in the table. 

Figure 7. Impact of positively charged amino acid substitutions on ssDNA 
binding of PEDV nsp9. (A) Depictions of the electrostatic potential surface of the 
PEDV nsp9 dimer created by APBS tools. The mutated amino acids exposed on the 


protein surface are labeled. (B) The ssDNA binding abilities of nsp9-WT, nsp9-K10A, 
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nsp9-R68A, K69A, nsp9-R106A, nsp9-K10A/R68A/K69A/R106A and nsp9-Y82A as 
determined by EMSA. Each protein was assayed at different concentrations (0 uM, 
0.41 uM, 0.82 uM, 1.64 uM, 3.28 uM, 6.56 uM, 10 uM) with 5 uM 5’-Cy5-labeled 
36-mer ssDNA. (C) The ssDNA binding affinities of nsp9-WT, nsp9-K10A, 
nsp9-R106A, nsp9-K10A/R68A/K69A/R106A and nsp9-Y82A as measured by MST 
assays. Each protein was used in twofold concentration steps with 10 nM 
5’-Cy5-labeled 36-mer ssDNA. The measured Ky of each protein is shown in the 


table. 
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Table 1 Statistics of data collection and refinement 


Value(s)* 


Wild-typePDCoVnsp9 


PDCoV nsp9-AN7 


Wild-typePED Vnsp9 


PEDVnsp9-C59A 


Data collection 


Space group 


Cell parameter [a, b, c (A)] 


Euo>a, Bs y(°) 
Wavelength (A) 


Resolution (A) (range) 


Completeness (%) 
Rrnerge (%) 
Z/o(last shell) 


Redundancy (last shell) 


Refinement 
Resolution (A) (range) 
Ruvork! Riree (%) 

No. reflections 

No of protein atoms 
No. of solvent atoms 
No. of molecules 
RMSD 

Bond length (A) 

Bond angle (°) 


Avg B factor (A’) 


Ramachandranplot:core,allow, 


disallow (%) 


P2, 
57.97,55.19,69.44 
90,00,90.01, 90.00 
0.97917 

29.79 — 1.80 
97.8(97.0) 
6.4(46.6) 
15.71(3.38) 


3.8(3.8) 


29.79 — 1.80 
20.6/24.6 
39706 

2970 


207 


33 


97.29,2.71,0.00 


P2, 
31.55,83.86,36.34 
90.00,106.08,90.00 
0.97917 

34.92 — 1.99 
99.7(100.0) 
7.5(10.3) 
24.51(23.27) 


7.0(7.2) 


34.92 — 2.00 
21.3/24.8 
12364 

1463 


70 


20 


97.81,2.19,0.00 


P64 
73.87, 73.87, 91.55 
90.00,90.00,120.00 
0.97917 

28.75 -2.89 

99.4 (99.2) 
6.3(67.7) 
35.33(3.00) 


11.1(11.4) 


28.75 — 2.90 
30.7/32.7 
6340 


1498 


109 


92.22, 6.67,1.11 


P4322 
58.73, 58.73, 193.24 
90.00, 90.00, 90.00 
0.97917 

38.15 — 3.00 
99.9(100.0) 
5.9(84.9) 

29.8(3.74) 


7.6(7.7) 


38.15 — 3.00 
29.5/33.5 
6925 

2249 


0 


103 


95.40, 3.83,0.77 
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“Highest resolution values are written in parenthesis. 

PR sive =X /J-<I>/Zz J (where is J; the intensity measurement of reflection h and <I> is 
theaverage intensity from multiple observations). 

“Rwork = = ||Fol - lFe|| / Z |Fol; where F, and F, are the observed and calculated structure 
factorsrespectively; Rfree is equivalent to Ryork but where 5% of the measured reflections have been 


excluded fromrefinement and set aside for cross-validation. 
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B al 
pl 
p6 

> 
2 N-finger Bl B2 B3 B4 
a PDCovVv . PTT a ————— > ——<——a> TT 
S PEDV ——— SS —=— == —- 
me) PEDV 1 NE QRSIKAE..GDG.IVGEGKALHYNINE GIGRT FMYPAF I|SIDKPD VVIKWEFDG 
re PDCoV 1 a TAQNTAQ. .DFNGNESTVKSFY VITRAIGK K ILVENT TISSTKDN TVMICLTET 
= TGEV 1 M ERAVRASATLDGEAFGSGKALMA\SE SIGK SFMYPAF IIAISDNN YVIKWESNN 
2 HCoV-229E 1 M VKATKGE..GDGGITSEGNALIYNNE GG@RAFMYIAY VITITKPG YWIKWEHDS 

SARS-CoV 1 S} OMSCAAGTTQTACTDDNALAYIYININS KIGGREVLENL LISID HOD WARFPKSD 

MERS-CoV 1 K TMVVSAGQEQTNCNT.SSLAY|YEIP!V QIGRKMLIMPIAL LISIDNAY WARVEGKD 

MHV 1 M TOVVNSGSD.MNCNT.PTQOCY|¥NITT GTIGK IVJYPAT LISID CDG Y/T|IKIVKED 

IBV 1 M TKACVAGVDQAHCSV.ESKCY|¥TIS/I SIGS S VVJARNI TISISNPN VIAISF LNEA 

BS B6 

PDCoV = —_—_—P —> 

PEDV =—>P —=> ————j>——TT 

PEDV 58 ae CNT L MmLVDSPNGAQ 

PDCoV 59 oe Re VI L IMAHTVGGKQS 

TGEV 61 oP L mY VDGANGPE 

HCoV-229E 59 ca VD L mVIDTPTITGPQ 

SARS-CoV 61 |GTGTIY L mVTDTPKGPK 

MERS-CoV 60 --FVS L mLIAGPKGPE 

MHV 59 N.CVV L MSVQDVKGLK 

IBV 60 |GN.QILY L MIGMKVGDKVE 
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05 


Predicted MW (kDa) AUC 


Monomer 
on (12.49mL,12.64kDa) Construct Mononmer Dimer //f0 S20.0(S) MW(kDa) 
5 a (monomer/dimer) —(monomer/dime 
. nsp9-WT 11.99 23.98 1.423 1.369/2.058 12.56/23.14 
nsp9-AN7 11.22 23.98 1.317 1,349/— 10.94/— 

$2 (10.87mL,25.33kDa) N-finger nsp9-R7A 11.99 23.98 1.373 1.396/1.961 12.26/20.41 
nsp9-L4A/L6A/R7A/N8A. 11.99 23.98 1.368 1.316/2.011 11.16/21.07 

si a-helical nsp9-G96E 11.99 23.98 1.368 1.349/1.963 11.58/20.32 
nsp9-G100E 11.99 23.98 1.367 1.348/2.024 11.56/21.26 
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WT 410 
AN7 794 
G96E 771 
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Predicted MW (kDa) AUC 
Construct S20.w(S) MW(kDa) 
Mononmer Dimer //f0 
(monomer/dimer) (monomer/dimer) 
nsp9-WT 12.09 24.18 1.421 1.489/1.927 14.22/20.92 
nsp9-C59A 12.09 24.18 1.393 1.372/— 12,.2/— 
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ean iim) 


C59A G9S5E/G99E/G102E C59A/G95E/G99E/G102E 


Ses f°) mee 2 Wa ed a ha dh he 


e 
RS o 
~ xa 
Fs 
Ps Re 
MST 

PEDV nsp9 mutants Kad(uM) SD 
WT 145 7.1 
C59A 396 11.0 
G95E/G99E/G102E 2050 42.6 
C59A/G95E/G99E/G102 5240 106 
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Y82A 


K10A/R68A/K69A/R1IO6A 
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